SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT : Ryals, John 

Delaney, Terry 
Friedrich, Leslie 
Weymann, Kristianna 
Law ton, Kay 
Ellis, Daniel 
Uknes, Scott 
Jesse, Taco 
Vos, Pieter 

(ii) TITLE OF INVENTION: GENE ENCODING A PROTEIN INVOLVED IN THE SIGNAL 
TRANSDUCTION CASCADE LEADING TO SYSTEMIC ACQUIRED RESISTANCE IN PLANTS 

(iii) NUMBER OF SEQUENCES: 17 

( iv) CORRESPONDENCE - ADDRESS : 

(A) ADDRESSEE: Novartis Corporation 

J* (B) STREET: 520 White Plains Road, P.O. Box 2005 

fi (C) CITY: Tarry town 

f£j (D) STATE: New York 

H (E) COUNTRY : USA 

N" (F) ZIP: 10591 

f| (v) COMPUTER READABLE FORM: 

P (A) MEDIUM TYPE: Floppy disk 

f*j (B) COMPUTER: IBM PC compatible 

01 (C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

H (vi) CURRENT APPLICATION DATA: 

jtt- (A) APPLICATION NUMBER: 

M 8 (B) FILING DATE : 

j|J (C) CLASSIFICATION: 

if 8 "?. 

Si (viii) ATTORNEY / AGENT INFORMATION: 

K (A) NAME: Meigs, J. Timothy 

(B) REGISTRATION NUMBER: 38,241 

(C) REFERENCE /DOCKET NUMBER: CGC 1909 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (919) 541-8587 

(B) TELEFAX: (919) 541-8689 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9919 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
TGATCATGAA TTGCGTGTAG GGTTGTGTTT TAAAGATAGG GATGAGCTGA AGAAGGCGGT 60 

GGACTGGTGT TCCATTAGAG GGCAGCAAAA GTGTGTAGTA CAAGAGATTG AGAAGGACGA 120 

GTATACGTTT AAATGCATCA GATGGAAATG CAATTGGTCG CGTCGGGCAG ATTGAATAGA 180 

AGAACATGGA CTTGTTAAGA TAACTAAGTG TAGTTGGTCC ACATACTTGT TGTTCTATTA 240 

AGCCGGAAAA CTTCAACTTG TAATTTGCAG CAGAAGAGAT TGAGTGTCTG ATCAGGGTAC 300 

AACCCACTCT AACAGCAGAG TTGAAAAGTT TGGTGACATG CTTAAAACTT CAAAGCTGCG 360 

GGCAGCAGAA CAGGAAGTAA TCAAAGATCA GAGTTTCAGA GTATTGCCTA AACTAATTGG 420 

CTGCATTTCA CTCATCTAAT GGGCTACTTG TGGACTGCAA TATGAGCTTT TCCCTAATCC 480 

TGAATTTGCA TCCTTCGGTG GCGCGTTTTG GGCGTTTCCA CAGTCCATTG AAGGGTTTCA 540 

ACACTGTAGA CCTCTGATCA TAGTGGATTC AAAAGACTTG AACGGCAAGT ACCCTATGAA 600 

0 ATTGATGATT TCCTCAGGAC TCGACGCTGA TGATTGCTTT TTCCCGCTTG CCTTTCCGCT 660 

TACCAAAGAA GTGTCCACTG ATAGTTGGCG TTGGTTTCTC ACTAATATCA GAGAGAAGGT 720 

10 AACACAAAGG AAAGACGTTT GCCTCGTCTC CAGTCCTCAC CCGGACATAG TTGCTGTTAT 780 

TAACGAACCC GGATCACTGT GGCAAGAACC TTGGGTCTAT CACAGGTTCT GTCTGGATTG 840 

TTTTTGCTTA CAATTCCATG ATATTTTTGG AGACTACAAC CTGGTGAGCC TTGTGAAGCA 900 

GGCTGGATCC ACAAGTCAGA AGGAAGAATT TGATTCCTAC ATAAAGGACA TCAAAAAGAA 960 

GGACTCAGAA GCTCGGAAAT GGTTAGCCCA ATTCCCTCAA AATCAGTGGG CTCTGGCTCA 1020 

TGACCAGTGG TCGGAGATAT GGAGTCATGA CGATAGAAAC AGAAGATTTG AGGGCAATTT 1080 

GTGAAAGCTT TCAGTCTCTT GGTCTATCAG TGACAGCGAA CGCACCTGCA CATGTGGGAA 1140 

GTTTCAATCG AAGAAGTTTC CATGTATGCA CCCAGAAATG GTGCAAAGGA TTGTTAACTT 1200 

GTGTCATTCA CAAATGTTGG ATGCAATGGA GCTGACTAGG AGAATGCACC TTACACGCCC 1260 

ACTCAGTGTT CTCTTATCTC TAGACCTGAA ACTAACTTGC TGTGTAATTC GAGTTACAAA 1320 

AGGTTAAAGG AAGAATTAGG AAGATACATA TAACATGAAT GTTGCCAGAA GTTCAGGGAA 1380 

CTTGAATATT CTTTTGGTTC TTGGTGGAAA ATATCCAACA GATGAACAAT TTGACATTAT 1440 

TTCACACTTT GATTCTAGCA ACTCTGTAAC ACCATCATGG GTTATTGTTG ATGTACATAA 1500 

ATATATATTA CAAATCTGTA TACCATTGGT TCAAATTGTT ACAACATTTG TTTGAAGCAC 1560 

ACCTGCAGCA ATAATACACA GGATGCAAAA CGAAGAGCGA AACTATATGA CGCCAACGAT 1620 

AGACATAAAC AGTTACAGTC ATCATGAAAA CAGAATTATA TGGTACAGCA AAAATTACAC 1680 

TAAGAGGCAA GAGTCTCACC GACGACGATG AGAGAGTTTA CGGTTAGACC TCTTTCCACC 1740 

GGTTGATTTC GATGTGGAAG AAGTCGAATC TGTCAGGGAC GAATTTCCTA ATTCCAAATT 1800 

GTCCTCACTA AAGGCCTTCT TTAGTGTCTC TTGTATTTCC ATGTACCTTT GCTTCTTTTG 1860 
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TAGTCGTTTC TCAGCAGTGT CGTCTTCTCC GCAAGCCAGT 
CATAATCTGG TCGAGCACTG CCGAACAGCG CGGGAAGAAT 
TGATAAAAAA AACAAGGTCA GACAGCAAGT AACAAAACCA 
TTGTTTTTTG TGATAAGGAG TCCGATGAAG TGGGTGAGAA 
CGCTTTTAGT CTACTTTGAT GCTCTTCTAG GATTCTGAAA 
TGATGTTCTC TTCGTACCAG TGAGACGGTC AGGCTCGAGG 
TGTTCCCTTC ATTTCGGCGA TCTCCATTGC AGCTTGTGCT 
AGCAAGTGCA ACTAAACAGT GGACGACACA AAGAATAGTT 
CCTAATAGAG AGGACATAAA TTTAATTCAA ACATATAAGA 
TCTATTTTCA AGATCGAGCA GCGTCATCTT CAATTCATCG 
AGGAACATCT CTAGGAATTT GTTCTCGTTT GTCTTCTTGC 
TCGGCCTTTG AGAGAATGCT TGCATTGCTC CGGGATATTA 
GGCTTGTTTT GCGATCATGA GTGCGGTTCT ACCTTCCAAA 
ACCTTTTTCC AATAGAGATA GTATCAATTG TGGCTCCTTC 
CACCGTATAT CCCCTCGGAT TCCTATGGTT GACATCGGCA 
TGTTGCGGTC TTCACATTGC AATATGCAAC AGCGAAATGA 
ATTGGTGTGA TCCTCTTTCA AAAGCAACTT GACTAACTCA 
CTTATGTACA TTCGAGACAT GTTTCTTTAC TTTAGGTACC 
TCTATCAATT ATCTCTTTAA CAAGCTCTTC CGGCAATGAC 
TACATTAGAC TTGACAATAA TCTCTTTACA TCTATCCAAT 
ACATATATTA GCAAGCTTGA GTATAACCAA TGTGTCCTCT 
GTCCAATAAG TGCCTCTGAA ATACAAATAC AAGTACTCAA 
GTGTAACCAT AGCTTAATGC AGATGGTGTT TTACCTGATA 
TCTTGAAGAT GAAAGCCAAA TAGAGAACCT CCAACATGAA 
CCACGTGGCA GCAATTCTCG TCTGCGCATT CAGAAACTCC 
TGCTGCTGTA AACATAAGCC AAAACAGTCA CAACCGAATC 
TGGCAATCTC CTTAAGCTCG AGCTTCACGG CGGCGGTGTT 
TAGCGGCGGC TAAAGCGCTC TTGAAGAAAG AGCTTCTCGC 
AAGAAACTTC CCGGCCGTCG GAGAGAACAA GCTTAGCGTC 
AGTCAAAGAC GGATTCGAAG CTGTTGGAGA GCAATTGCAG 
TGAGTACTTG TTCGGCGGCC AGATAAACAA TAGAGGAGTC 
AACTAGTGCT GCTGATTTCA TAAGAATCGG CGAATCCATC 
GGTTCCGATG AATTGAAATT CACAAATTAA AGAGATCTCT 
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TGAGTCAAGT 
CGTTTCCCGA 
TGTTTAAAGA 
TCCATACCGG 
GGTGCTATCT 
CTAGTCACTA 
TCCGTTGGAA 
ATCATTAGTT 
AATAAGACTT 
GCCGCCACTG 
TCTAGTATTT 
TTACATTCAA 
GTTGCTTCTG 
CGCATCGCAG 
AGATCAAGTT 
AGAGCACACG 
ATATCATCCG 
TCCAAACCAA 
TTTTCAAGAC 
AGCTTCATAC 
ATAACAACTT 
GTAAGAACAT 
GAGAGTAATT 
ATCCACCGCC 
TTTAGGCGGC 
GAAACCGACT 
GTTGGAGTCT 
TGACAAAACG 
GCTGTAGAAA 
AGCAGATACA 
GGTGTTATCG 
AATGGTGGTG 
GCTAATCAAC 



CCTCACAGTT 

GTTCCACTGA 

TCATTTAGTT 

TTTTAGAAAG 

TTACACCCGG 

TGAACTCACA 

AAAGACGTTG 

CACTCAGTTT 

GATAGATACC 

CAAAAGAGGG 

CTACACATAG 

CCGCCATAGT 

ATGCACTTGC 

CAACATGAAG 

TTAAAAGATC 

CATCATCTAG 

AGTCAAGTGC 
GCTCTTTACG 

TAACCATATC 
AAGCTTTACC 
TGTCTACAAC 
ATTCATGAAT 
AATTCAGGGA 
GGCCGGCAAG 
GGTCTCACTC 
TCGTAATCCT 
TTCTCCTTCT 
CACCGGTGGA 
TCATCCGGCG 
TCAGGTCCGG 
GTAGCGACGA 
TCCATCAACA 
GAAGAGACCT 



1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 



IT* 



ft 

iL.,,i: 
;s:; ;: 



TATCAACTGG ATTTGGTTAA AGATCGAAGA TAACCATTGA 
CAACGAGAGT GGTGGTGAGA TATGAAGAAG CATCCTCGTC 
AAACCGGTAA ATTTCCAGGA AAGGAATCTT TGTCAGAGAT 
AGGAAGCTAA ACCGGTTCGG GTTATAAATG TTAGTATTTA 
GCTAATTTTT GTATATGAGA AGTTCAATCC GGTTCGGTAA 
TTGGAGATGA TATAAATATA TAAAATTTAT TTTTCATCCG 
TATATAAATA TTATTTTTTA AATTTAAGAA TTAGATTTAC 
TTTATTTTCT TTGAAGTAAA ATGATAAAGG GAACGTATAT 
CATAAGTTTT GTAATGTATA TTATATTTTT CGTTTATTGA 
AGCATGTTTA CACTATAATT AAATCAAGTC GAATATTTCC 
TATAGCAAAT GAAAACGCTC TTCACAACAA AATCATTATA 
TAAAAACATG AAAGTCATAA TGAATATATT TTTTTAATTA 
TATTGTATAC ATATAAAAGA CTTCTTTAGT TATTTGCCTT 
TGCGATAAAT CAGCTTTTTC AATAACTACG ACGTAAAAGC 
CAAATTTGGC TCATCCTTCA CTTGATTGGT GTTTTCCGGA 
AGAAGAAGAA GGAATCTGCA TAATCACCTC TTGGTTCCTC 
GGATCGAAAA CGATCGAGAT CAGAAAATGA AAAGATAGGT 
ACAACGTAAG ATTATGTTGA ATAAACAGAG TACTTTATAT 
TAAATTATTG CTTTCCGCGT TTTTTACTTT TGTATTTCTT 
ATAAGATTTG TATGATTTTA AGTAAATTTA CAATAACTCT 
ATATTTAATT AATTTTACTA ATTATCTTTT GAACAATTTT 
TTAATTTTTT AAAATGATAT ATTATAAAAT TTAATTGAAT 
ATCTTCTACC ATCTATTATA GTTGATAAAT ATTGTGATAA 
TGCCAAATAT TTAATAAATT TTGTGTACCA TGCGTTTTTT 
GACAGCATAC CGTACATATA TTGTATAAAA GCTTATAAAA 
GTAAGCTATA AATATATGTA AACAATAGTA AGATATTACG 
TGCTTTAGAT ATTATGTATA TCTAATATAT TAAAATATCT 
TTAAGAGAGA AAATTGGGAC ACTATTTTCT ATACAGTAAC 
AACCCTTGAT ATAATAAAAT AACTAGCCAA AAAATCAGAT 
TTTGGTATTA TTACATAAAC CTAAGAAACA AAATTCAATA 
AACAATTAAA CATCACTAGA TATATTTATG CCCCACAATG 
AGACTTGAGA TCCTTGTCAA CTACGTTTGC ATTTGTCGGC 
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CGAGCAGAGC CAAGTCAAGT 3900 

CCACGGTTTA CATTTCACCA 3960 

CTTTTTTAAA AAGATATAAC 4020 

TACCGGAGAC ATTTTGTGTT 4080 

GCCCCTGAAC CAAACTAGAT 4140 

GTTCGTTATT TTCATATAAA 4200 

ATGTGAAAGT TACATTTCTG 4260 

TAAGTTTCAT GCTTTATTCA 432 0 

AAAAGTAATT TTCAGTGTTC 4380 

TGGAACTATT CTCCTTGTTC 4440 

GATATAGGAA TAAATTACAT 4500 

GGATTTGATT TAAAAACAAT 4560 

CAACTTCTCG TTCTGAATCA 4620 

AAATTCATAA CACGTCTAAA 4680 

CTCGATGTTG CTGGAAACTG 4740 

ACCGGTAGAC TCATTTTGTT 4800 

TAAAGATGCC TATGAATACA 4860 

AGGAGTTATA ATAAGGTAAA 4920 

AAATGATAAG TTAAATTAGG 4980 

CTATAACTCA ATAGCATCAC 5040 

ATGAAATAGT TTTCTTTTAA 5100 

CAATCTGATA TAATTTTTTT 5160 

ACTTTAGATA AACACCCAAT 5220 

TTGGAGAATA TATATACGTG 5280 

CATAGATACG GGTTATATTG 5340 

TGTTGTGTCT AAATATGTGT 5400 

TTTATTAACT AATATATTAT 5460 

TGTTTTCAAC TATAAACAGG 5520 

TAAATATTCA TAAAACAATG 5580 

TTCCTTTTTA CCTTATAAAA 5640 

AGCGAGCCAA TTGAGACTTG 5700 

CCATTTTTTT TATTTTTTTT 5760 
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TTAAAGTGTC GGCCCGTTGC TTCTTCCGTT CAGATCAACC CTCTCGTAAT CAGAACAAAA 
CGGAAAACAA ACGAAAGAAC AATCAGATCC CTCTTTTTTT GCATAAACTA AATTCAACTT 
CTCTGCGTTT ATGTTGTAGA GGCAACCACG ATCACTACTA CGAAACAATA CAACGTCGTT 
GCTTGGAGTC CACGTAATCA AATCTACTCC AATGCTTTTA ATATCTTTCA CTTTAACCCA 
CGACTTTTCA AAACTGCTCT TTAAAACCCA TAACTCGTGA ACATCTTCTT GATCTTTGTT 
TGTCCACTGA CGAATAGCAC CTAGCTTCCC TTCGTATCTG ACTAATCCTG AGAAAACATC 
AGAGTTCGGA GTATGGAAGA AGGACCAAGT TTCGGTTTTG AGACAAAACC GGATCACATT 
GTTGTTCCGT GATATCCAAT GCAAGAACCC CGAAACTTGT ATCGGGTTGG AAAAAATTAA 
TCTGTCTGTT TTTGGTAGAC GCAAATTTTC TAATCTCTTC CAGGTAAACG AATCAGAATC 
GAAAACTTCG CACATAAAAG TTCTGTGATT CAAATGGTAG ATACCCCGAG ACATACACAT 
ACGCCGAGAC TGCGAAAGCC TTTGTATTTT ATACCGGAAA GGGTTCAATC CGATTACCGC 
TAAACCCAAT GACATATCCC AACCCTTCAC TTCTGGCTTT GGTATGACCT GATACTGTTT 
AGTGGTTGGT TTGAAGACTA TGTATCCACG TGATGGTTTT GTATACTTAA CACAAAGCAA 
TATCCCATGA CTTGCATCAC AAGCTTCGAT CTTTATCATT CCGGGTGGCA GAAAGTCGAT 
GGAGACTCCA TTGTTTTGTA AATCACTCCT CTCATGGACA AAACTGGTTC GAAGTTCGTG 
TCCTTTTACT ATGTAGTGTT GTATGAAGTA TCCCGAAATA CGATTGGTTC TAAGGAGATT 
AAGATTGACA AACCATGACT CGTAGCTTCT CTTGTTGCAC TCTTTATTCA GGAGCCTGAA 
TTTTCCGATT TTTGACGCCG GAAGATAAGA AAGAAATTCT TGGATCATGT CTTGATTTAT 
CACCGGAGAA CTCATGATCC TGTCGGGAAT AAAGAGATGA GCACGATCAC TGAATGAGAA 
ATGAAAAAAT CAGGATCGGT AGAGAACAAC TTATGATGAA TAAAGTGTTT ATATATCCTT 
TCTTTGTTTA AGGAAAGTAT CAAAATTTGC CTTTTTCTTC GCTAGTCCTA AAACAAACAA 
ATTAACCAAA AGATAAAATC TTTCATGATT AATGTTACTT GTGATACCTT AAGCCAAAAC 
TTTATCTTTA GACTTTTAAC CAAATCTACA GTAATTTAAT TGCTAGACTT AGGAAACAAC 
TTTTTTTTTT ACCCAACAAT CTTTGGATTT TAATTGTTTT TTTTTCTACT AATAGATTAA 
CAACTCATTA TATAATAATG TTTCTATCAT AATTGACAAT TCTTTCTTTT TAATAAACAT 
CCAGCTTGTA TAATAATCCA CAAGTCAATT TCACCATTTT GGCCAATTTA TTTTCTTATA 
AAAATTAGCA CAAAAAAGAT TATCATTGTT TAGCAGATTT AATTTCTAAT TAACTTACGT 
AATTTCCATT TTCCATAGAT TTATCTTTCT TTTTATTTCC TTAGTTATCT TAGTACTTTC 
TTAGTTTCCT TAGTAATTTT AAATTTTAAG ATAATATATT GAAATTAAAA GAAGAAAAAA 
AACTCTAGTT ATACTTTTGT TAAATGTTTC ATCACACTAA CTAATAATTT TTTTTAGTTA 
AATTACAATA TATAAACACT GAAGAAAGTT TTTGGCCCAC ACTTTTTTGG GATCAATTAG 
TACTATAGTT AGGGGAAGAT TCTGATTTAA AGGATACCAA AAATGACTAG TTAGGACATG 
AATGAAAACT TATAATCTCA ATAACATACA TACGTGTTAC TGAACAATAG TAACATCTTA 
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5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 

6840 

6900 

6960 

7020 

7080 

7140 

7200 

7260 

7320 

7380 

7440 

7500 

7560 

7620 

7680 

7740 



TTTTAGCTTT ATGAAAAAAA ATATCAAATC ACTGAAGACA TTTGTTGGCC TATACTCTAT 9720 

TTTTTATTTG GCCAATTAGT AATAGACTAA TAGTAACTCA TATGATATCT CTCTAATTCT 9780 

GGCGAAACGA ATATTCTGAT TCTAAAGATA GTAAAAATGA ATTTTGATGA AGGGAATACT 9840 

ATTTCACACA CCTAGAAAGA GTAAGGTAGA AACCTTTTTT TTTTTGGTCA GATTCTTGTA 9900 

TCAAGAAGTT CTCATCGAT 9919 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5655 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(ix) FEATURE: 

(A) NAME /KEY: exon 

(B) LOCATION: 2787.. 3347 

(D) OTHER INFORMATION: /product= "1st exon of NIM1 

(ix) FEATURE: 

(A) NAME/KEY: exon 

(B) LOCATION: 3427.. 4162 

(D) OTHER INFORMATION: /product= "2nd exon of NIM1" 

(ix) FEATURE: 

(A) NAME/KEY: exon 

(B) LOCATION: 4271.. 4474 

(D) OTHER INFORMATION: /product= "3rd exon of NIM1" 

(ix) FEATURE: 

(A) NAME/KEY: exon 

(B) LOCATION: 4586.. 4866 

(D) OTHER INFORMATION: /product= "4th exon of NIMl" 
(ix) FEATURE: 

(B) LOCATION! join(2787..3347, 3427.. 4162, 4271.. 4474, 4586.. 4866) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

TGTGATGCAA GTCATGGGAT ATTGCTTTGT GTTAAGTATA CAAAACCATC ACGTGGATAC 60 

ATAGTCTTCA AACCAACCAC TAAACAGTAT CAGGTCATAC CAAAGCCAGA AGTGAAGGGT 120 

TGGGATATGT CATTGGGTTT AGCGGTAATC GGATTGAACC CTTTCCGGTA TAAAATACAA 180 

AGGCTTTCGC AGTCTCGGCG TATGTGTATG TCTCGGGGTA TCTACCATTT GAATCACAGA 240 

ACTTTTATGT GCGAAGTTTT CGATTCTGAT TCGTTTACCT GGAAGAGATT AGAAAATTTG 300 

CGTCTACCAA AAACAGACAG ATTAATTTTT TCCAACCCGA TACAAGTTTC GGGGTTCTTG 3 60 

CATTGGATAT CACGGAACAA CAATGTGATC CGGTTTTGTC TCAAAACCGA AACTTGGTCC 420 
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TTCTTCCATA CTCCGAACTC TGATGTTTTC TCAGGATTAG TCAGATACGA AGGGAAGCTA 
GGTGCTATTC GTCAGTGGAC AAACAAAGAT CAAGAAGATG TTCACGAGTT ATGGGTTTTA 
AAGAGCAGTT TTGAAAAGTC GTGGGTTAAA GTGAAAGATA TTAAAAGCAT TGGAGTAGAT 
TTGATTACGT GGACTCCAAG CAACGACGTT GTATTGTTTC GTAGTAGTGA TCGTGGTTGC 
CTCTACAACA TAAACGCAGA GAAGTTGAAT TTAGTTTATG CAAAAAAAGA GGGATCTGAT 
TGTTCTTTCG TTTGTTTTCC GTTTTGTTCT GATTACGAGA GGGTTGATCT GAACGGAAGA 
AGCAACGGGC CGACACTTTA AAAAAAAAAT AAAAAAAATG GGCCGACAAA TGCAAACGTA 
GTTGACAAGG ATCTCAAGTC TCAAGTCTCA ATTGGCTCGC TCATTGTGGG GCATAAATAT 
ATCTAGTGAT GTTTAATTGT TTTTTATAAG GTAAAAAGGA ATATTGAATT TTGTTTCTTA 
GGTTTATGTA ATAATACCAA ACATTGTTTT ATGAATATTT AATCTGATTT TTTGGCTAGT 
TATTTTATTA TATCAAGGGT TCCTGTTTAT AGTTGAAAAC AGTTACTGTA TAGAAAATAG 
TGTCCCAATT TTCTCTCTTA AATAATATAT TAGTTAATAA AAGATATTTT AATATATTAG 
ATATACATAA TATCTAAAGC AACACATATT TAGACACAAC ACGTAATATC TTACTATTGT 
TTACATATAT TTATAGCTTA CCAATATAAC CCGTATCTAT GTTTTATAAG CTTTTATACA 
ATATATGTAC GGTATGCTGT CCACGTATAT ATATTCTCCA AAAAAAACGC ATGGTACACA 
AAATTTATTA AATATTTGGC AATTGGGTGT TTATCTAAAG TTTATCACAA TATTTATCAA 
CTATAATAGA TGGTAGAAGA TAAAAAAATT ATATCAGATT GATTCAATTA AATTTTATAA 
TATATCATTT TAAAAAATTA ATTAAAAGAA AACTATTTCA TAAAATTGTT CAAAAGATAA 
TTAGTAAAAT TAATTAAATA TGTGATGCTA TTGAGTTATA GAGAGTTATT GTAAATTTAC 
TTAAAATCAT ACAAATCTTA TCCTAATTTA ACTTATCATT TAAGAAATAC AAAAGTAAAA 
AACGCGGAAA GCAATAATTT ATTTACCTTA TTATAACTCC TATATAAAGT ACTCTGTTTA 
TTCAACATAA TCTTACGTTG TTGTATTCAT AGGCATCTTT AACCTATCTT TTCATTTTCT 
GATCTCGATC GTTTTCGATC CAACAAAATG AGTCTACCGG TGAGGAACCA AGAGGTGATT 
ATGCAGATTC CTTCTTCTTC TCAGTTTCCA GCAACATCGA GTCCGGAAAA CACCAATCAA 
GTGAAGGATG AGCCAAATTT GTTTAGACGT GTTATGAATT TGCTTTTACG TCGTAGTTAT 
TGAAAAAGCT GATTTATCGC ATGATTCAGA ACGAGAAGTT GAAGGCAAAT AACTAAAGAA 
GTCTTTTATA TGTATACAAT AATTGTTTTT AAATCAAATC CTAATTAAAA AAATATATTC 
ATTATGACTT TCATGTTTTT AATGTAATTT ATTCCTATAT CTATAATGAT TTTGTTGTGA 
AGAGCGTTTT CATTTGCTAT AGAACAAGGA GAATAGTTCC AGGAAATATT CGACTTGATT 
TAATTATAGT GTAAACATGC TGAACACTGA AAATTACTTT TTCAATAAAC GAAAAATATA 
ATATACATTA CAAAACTTAT GTGAATAAAG CATGAAACTT AATATACGTT CCCTTTATCA 
TTTTACTTCA AAGAAAATAA ACAGAAATGT AACTTTCACA TGTAAATCTA ATTCTTAAAT 



480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
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TTAAAAAATA ATATTTATAT ATTTATATGA AAATAACGAA CCGGATGAAA AATAAATTTT 

ATATATTTAT ATCATCTCCA AATCTAGTTT GGTTCAGGGG CTTACCGAAC CGGATTGAAC 

TTCTCATATA CAAAAATTAG CAACACAAAA TGTCTCCGGT ATAAATACTA ACATTTATAA 

CCCGAACCGG TTTAGCTTCC TGTTATATCT TTTTAAAAAA GATCTCTGAC AAAGATTCCT 

TTCCTGGAAA TTTACCGGTT TTGGTGAAAT GTAAACCGTG GGACGAGGAT GCTTCTTCAT 

ATCTCACCAC CACTCTCGTT GACTTGACTT GGCTCTGCTC GTCAATGGTT ATCTTCGATC 

TTTAACCAAA TCCAGTTGAT AAGGTCTCTT CGTTGATTAG CAGAGATCTC TTTAATTTGT 

GAATTTCAAT TCATCGGAAC CTGTTG ATG GAC ACC ACC ATT GAT GGA TTC GCC 

Met Asp Thr Thr He Asp Gly Phe Ala 
1 ~ 5 

GAT TCT TAT GAA ATC AGC AGC ACT AGT TTC GTC GCT ACC GAT AAC ACC 
Asp Ser Tyr Glu He Ser Ser Thr Ser Phe Val Ala Thr Asp Asn Thr 
10 15 20 25 

GAC TCC TCT ATT GTT TAT CTG GCC GCC GAA CAA GTA CTC ACC GGA CCT 
Asp Ser Ser He Val Tyr Leu Ala Ala Glu Gin Val Leu Thr Gly Pro 
30 35 40 

GAT GTA TCT GCT CTG CAA TTG CTC TCC AAC AGC TTC GAA TCC GTC TTT 
Asp Val Ser Ala Leu Gin Leu Leu Ser Asn Ser Phe Glu Ser Val Phe 
45 50 55 

GAC TCG CCG GAT GAT TTC TAC AGC GAC GCT AAG CTT GTT CTC TCC GAC 
Asp Ser Pro Asp Asp Phe Tyr Ser Asp Ala Lys Leu Val Leu Ser Asp 
60 " 65 70 

GGC CGG GAA GTT TCT TTC CAC CGG TGC GTT TTG TCA GCG AGA AGC TCT 
Gly Arg Glu Val Ser Phe His Arg Cys Val Leu Ser Ala Arg Ser Ser 
75 80 85 

TTC TTC AAG AGC GCT TTA GCC GCC GCT AAG AAG GAG AAA GAC TCC AAC 
Phe Phe Lys Ser Ala Leu Ala Ala Ala Lys Lys Glu Lys Asp Ser Asn 
90 95 100 105 

AAC ACC GCC GCC GTG AAG CTC GAG CTT AAG GAG ATT GCC AAG GAT TAC 
Asn Thr Ala Ala Val Lys Leu Glu Leu Lys Glu He Ala Lys Asp Tyr 
110 115 120 

GAA GTC GGT TTC GAT TCG GTT GTG ACT GTT TTG GCT TAT GTT TAC AGC 
Glu Val Gly Phe Asp Ser Val Val Thr Val Leu Ala Tyr Val Tyr Ser 
125 130 135 

AGC AGA GTG AGA CCG CCG CCT AAA GGA GTT TCT GAA TGC GCA GAC GAG 
Ser Arg Val Arg Pro Pro Pro Lys Gly Val Ser Glu Cys Ala Asp Glu 
140 145 150 

AAT TGC TGC CAC GTG GCT TGC CGG CCG GCG GTG GAT TTC ATG TTG GAG 
Asn Cys Cys His Val Ala Cys Arg Pro Ala Val Asp Phe Met Leu Glu 
155 160 165 

GTT CTC TAT TTG GCT TTC ATC TTC AAG ATC CCT GAA TTA ATT ACT CTC 
Val Leu Tyr Leu Ala Phe He Phe Lys He Pro Glu Leu He Thr Leu 
170 175 180 185 

TAT CAG GTAAAACACC ATCTGCATTA AGCTATGGTT ACACATTCAT GAATATGTTC 
Tyr Gin 



2400 
2460 
2520 
2580 
2640 
2700 
2760 
2813 

2861 

2909 

2957 

3005 

3053 

3101 

3149 

3197 

3245 

3293 

3341 

3397 
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TTACTTGAGT ACTTGTATTT GTATTTCAG AGG CAC TTA TTG GAC GTT GTA GAC 3450 

Arg His Leu Leu Asp Val Val Asp 
190 195 

AAA GTT GTT ATA GAG GAC ACA TTG GTT ATA CTC AAG CTT GCT AAT ATA 3498 
Lys Val Val He Glu Asp Thr Leu Val He Leu Lys Leu Ala Asn He 
200 205 210 

TGT GGT AAA GCT TGT ATG AAG CTA TTG GAT AGA TGT AAA GAG ATT ATT 3546 
Cys Gly Lys Ala Cys Met Lys Leu Leu Asp Arg Cys Lys Glu He He 
215 ~ 220 225 

GTC AAG TCT AAT GTA GAT ATG GTT AGT CTT GAA AAG TCA TTG CCG GAA 3594 
Val Lys Ser Asn Val Asp Met Val Ser Leu Glu Lys Ser Leu Pro Glu 
230 235 240 

GAG CTT GTT AAA GAG ATA ATT GAT AGA CGT AAA GAG CTT GGT TTG GAG 3642 
Glu Leu Val Lys Glu He He Asp Arg Arg Lys Glu Leu Gly Leu Glu 
245 " 250 255 

GTA CCT AAA GTA AAG AAA CAT GTC TCG AAT GTA CAT AAG GCA CTT GAC 3690 
Val Pro Lys Val Lys Lys His Val Ser Asn Val His Lys Ala Leu Asp 
260 "* 265 270 275 

TCG GAT GAT ATT GAG TTA GTC AAG TTG CTT TTG AAA GAG GAT CAC ACC 3738 
^ Ser Asp Asp He Glu Leu Val Lys Leu Leu Leu Lys Glu Asp His Thr 

280 285 290 



Ui AAT CTA GAT GAT GCG TGT GCT CTT CAT TTC GCT GTT GCA TAT TGC AAT 3786 

Asn Leu Asp Asp Ala Cys Ala Leu His Phe Ala Val Ala Tyr Cys Asn 
m 295 300 305 

h- GTG AAG ACC GCA ACA GAT CTT TTA AAA CTT GAT CTT GCC GAT GTC AAC 3834 
T, Val Lys Thr Ala Thr Asp Leu Leu Lys Leu Asp Leu Ala Asp Val Asn 
IV 310 315 320 



CAT AGG AAT CCG AGG GGA TAT ACG GTG CTT CAT GTT GCT GCG ATG CGG 3882 
His Arg Asn Pro Arg Gly Tyr Thr Val Leu His Val Ala Ala Met Arg 
325 330 335 

AAG GAG CCA CAA TTG ATA CTA TCT CTA TTG GAA AAA GGT GCA AGT GCA 3930 
Lys Glu Pro Gin Leu He Leu Ser Leu Leu Glu Lys Gly Ala Ser Ala 
340 345 350 355 

TCA GAA GCA ACT TTG GAA GGT AGA ACC GCA CTC ATG ATC GCA AAA CAA 3978 
Ser Glu Ala Thr Leu Glu Gly Arg Thr Ala Leu Met He Ala Lys Gin 
360 365 370 

GCC ACT ATG GCG GTT GAA TGT AAT AAT ATC CCG GAG CAA TGC AAG CAT 4026 
Ala Thr Met Ala Val Glu Cys Asn Asn He Pro Glu Gin Cys Lys His 
375 380 385 

TCT CTC AAA GGC CGA CTA TGT GTA GAA ATA CTA GAG CAA GAA GAC AAA 4074 
Ser Leu Lys Gly Arg Leu Cys Val Glu He Leu Glu Gin Glu Asp Lys 
390 395 400 

CGA GAA CAA ATT CCT AGA GAT GTT CCT CCC TCT TTT GCA GTG GCG GCC 4122 
Arg Glu Gin He Pro Arg Asp Val Pro Pro Ser Phe Ala Val Ala Ala 
405 410 415 

GAT GAA TTG AAG ATG ACG CTG CTC GAT CTT GAA AAT AGA G 4162 
Asp Glu Leu Lys Met Thr Leu Leu Asp Leu Glu Asn Arg 
420 425 430 
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M 

O 
w 

IP. 



GTATCTATCA AGTCTTATTT CTTATATGTT TGAATTAAAT TTATGTCCTC TCTATTAGGA 

AACTGAGTGA ACTAATGATA ACTATTCTTT GTGTCGTCCA CTGTTTAG TT GCA CTT 

Val Ala Leu 
435 

GCT CAA CGT CTT TTT CCA ACG GAA GCA CAA GCT GCA ATG GAG ATC GCC 
Ala Gin Arg Leu Phe Pro Thr Glu Ala Gin Ala Ala Met Glu He Ala 
440 445 450 

GAA ATG AAG GGA ACA TGT GAG TTC ATA GTG ACT AGC CTC GAG CCT GAC 
Glu Met Lys Gly Thr Cys Glu Phe He Val Thr Ser Leu Glu Pro Asp 
455 460 465 

CGT CTC ACT GGT ACG AAG AGA ACA TCA CCG GGT GTA AAG ATA GCA CCT 
Arg Leu Thr Gly Thr Lys Arg Thr Ser Pro Gly Val Lys He Ala Pro 
470 ~ 475 480 

TTC AGA ATC CTA GAA GAG CAT CAA AGT AGA CTA AAA GCG CTT TCT AAA 
Phe Arg He Leu Glu Glu His Gin Ser Arg Leu Lys Ala Leu Ser Lys 
485 490 495 

ACC G GTATGGATTC TCACCCACTT CATCGGACTC CTTATCACAA AAAACAAAAC 

Thr 

500 

TAAATGATCT TTAAACATGG TTTTGTTACT TGCTGTCTGA CCTTGTTTTT TTTATCATCA 

G TG GAA CTC GGG AAA CGA TTC TTC CCG CGC TGT TCG GCA GTG CTC 
Val Glu Leu Gly Lys Arg Phe Phe Pro Arg Cys Ser Ala Val Leu 
505 510 515 

GAC CAG ATT ATG AAC TGT GAG GAC TTG ACT CAA CTG GCT TGC GGA GAA 
Asp Gin He Met Asn Cys Glu Asp Leu Thr Gin Leu Ala Cys Gly Glu 
520 525 530 

GAC GAC ACT GCT GAG AAA CGA CTA CAA AAG AAG CAA AGG TAC ATG GAA 
Asp Asp Thr Ala Glu Lys Arg Leu Gin Lys Lys Gin Arg Tyr Met Glu 
535 540 545 

ATA CAA GAG ACA CTA AAG AAG GCC TTT AGT GAG GAC AAT TTG GAA TTA 
He Gin Glu Thr Leu Lys Lys Ala Phe Ser Glu Asp Asn Leu Glu Leu 
550 555 560 

GGA AAT TCG TCC CTG ACA GAT TCG ACT TCT TCC ACA TCG AAA TCA ACC 
Gly Asn Ser Ser Leu Thr Asp Ser Thr Ser Ser Thr Ser Lys Ser Thr 
565 570 575 

GGT GGA AAG AGG TCT AAC CGT AAA CTC TCT CAT CGT CGT CGG TGA 
Gly Gly Lys Arg Ser Asn Arg Lys Leu Ser His Arg Arg Arg * 
580 ~ 585 590 

GACTCTTGCC TCTTAGTGTA ATTTTTGCTG TACCATATAA TTCTGTTTTC ATGATGACTG 
TAACTGTTTA TGTCTATCGT TGGCGTCATA TAGTTTCGCT CTTCGTTTTG CATCCTGTGT 
ATTATTGCTG CAGGTGTGCT TCAAACAAAT GTTGTAACAA TTTGAACCAA TGGTATACAG 
ATTTGTAATA TATATTTATG TACATCAACA ATAACCCATG ATGGTGTTAC AGAGTTGCTA 
GAATCAAAGT GTGAAATAAT GTCAAATTGT TCATCTGTTG GATATTTTCC ACCAAGAACC 
AAAAGAATAT TCAAGTTCCC TGAACTTCTG GCAACATTCA TGTTATATGT ATCTTCCTAA 
TTCTTCCTTT AACCTTTTGT AACTCGAATT ACACAGCAAG TTAGTTTCAG GTCTAGAGAT 
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Ala Ala Glu Gin Val Leu Thr Gly Pro Asp Val Ser Ala Leu Gin Leu 
35 40 45 



Leu Ser Asn Ser Phe Glu Ser Val Phe Asp Ser Pro Asp Asp Phe Tyr 
« 50 55 60 



Ser Asp Ala Lys Leu Val Leu Ser Asp Gly Arg Glu Val Ser Phe His 
65 70 75 80 

Arq Cys Val Leu Ser Ala Arg Ser Ser Phe Phe Lys Ser Ala Leu Ala 
85 90 95 

Ala Ala Lys Lys Glu Lys Asp Ser Asn Asn Thr Ala Ala Val Lys Leu 
100 ~ 105 110 

Glu Leu Lys Glu He Ala Lys Asp Tyr Glu Val Gly Phe Asp Ser Val 
115 120 125 

Val Thr Val Leu Ala Tyr Val Tyr Ser Ser Arg Val Arg Pro Pro Pro 
130 135 140 

Lys Gly Val Ser Glu Cys Ala Asp Glu Asn Cys Cys His Val Ala Cys 
145 " 150 155 160 

Arg Pro Ala Val Asp Phe Met Leu Glu Val Leu Tyr Leu Ala Phe He 
165 170 175 

Phe Lys He Pro Glu Leu He Thr Leu Tyr Gin Arg His Leu Leu Asp 
180 185 190 

Val Val Asp Lys Val Val He Glu Asp Thr Leu Val He Leu Lys Leu 
195 200 205 
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5655 



AAGAGAACAC TGAGTGGGCG TGTAAGGTGC ATTCTCCTAG TCAGCTCCAT TGCATCCAAC 5346 

ATTTGTGAAT GACACAAGTT AACAATCCTT TGCACCATTT CTGGGTGCAT ACATGGAAAC 5406 

TTCTTCGATT GAAACTTCCC ACATGTGCAG GTGCGTTCGC TGTCACTGAT AGACCAAGAG 5466 

ACTGAAAGCT TTCACAAATT GCCCTCAAAT CTTCTGTTTC TATCGTCATG ACTCCATATC 5526 

TCCGACCACT GGTCATGAGC CAGAGCCCAC TGATTTTGAG GGAATTGGGC TAACCATTTC 5586 

CGAGCTTCTG AGTCCTTCTT TTTGATGTCC TTTATGTAGG AATCAAATTC TTCCTTCTGA 5646 
CTTGTGGAT 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 594 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

P (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

"'si 

43 Met Asp Thr Thr He Asp Gly Phe Ala Asp Ser Tyr Glu He Ser Ser 
O 1 5 10 15 

hi 

m Thr Ser Phe Val Ala Thr Asp Asn Thr Asp Ser Ser He Val Tyr Leu 

Ul 20 25 30 

PR-' 



Ala Asn He Cys Gly Lys Ala Cys Met Lys Leu Leu Asp Arg Cys Lys 
210 215 220 

Glu He He Val Lys Ser Asn Val Asp Met Val Ser Leu Glu Lys Ser 
225 230 235 240 

Leu Pro Glu Glu Leu Val Lys Glu He He Asp Arg Arg Lys Glu Leu 
245 250 255 

Gly Leu Glu Val Pro Lys Val Lys Lys His Val Ser Asn Val His Lys 
260 265 270 

Ala Leu Asp Ser Asp Asp He Glu Leu Val Lys Leu Leu Leu Lys Glu 
275 280 285 

Asp His Thr Asn Leu Asp Asp Ala Cys Ala Leu His Phe Ala Val Ala 
290 295 300 

Tyr Cys Asn Val Lys Thr Ala Thr Asp Leu Leu Lys Leu Asp Leu Ala 
305 310 315 320 

Asp Val Asn His Arg Asn Pro Arg Gly Tyr Thr Val Leu His Val Ala 
325 330 335 

Ala Met Arg Lys Glu Pro Gin Leu He Leu Ser Leu Leu Glu Lys Gly 
340 345 350 

Ala Ser Ala Ser Glu Ala Thr Leu Glu Gly Arg Thr Ala Leu Met He 
355 360 365 

Ala Lys Gin Ala Thr Met Ala Val Glu Cys Asn Asn He Pro Glu Gin 
370 375 380 

Cys Lys His Ser Leu Lys Gly Arg Leu Cys Val Glu He Leu Glu Gin 
385 390 395 400 

Glu Asp Lys Arg Glu Gin He Pro Arg Asp Val Pro Pro Ser Phe Ala 
405 410 415 

Val Ala Ala Asp Glu Leu Lys Met Thr Leu Leu Asp Leu Glu Asn Arg 
420 - 425 430 

Val Ala Leu Ala Gin Arg Leu Phe Pro Thr Glu Ala Gin Ala Ala Met 
435 440 445 

Glu He Ala Glu Met Lys Gly Thr Cys Glu Phe He Val Thr Ser Leu 
450 455 460 

Glu Pro Asp Arg Leu Thr Gly Thr Lys Arg Thr Ser Pro Gly Val Lys 
465 470 475 480 

He Ala Pro Phe Arg He Leu Glu Glu His Gin Ser Arg Leu Lys Ala 
485 490 495 

Leu Ser Lys Thr Val Glu Leu Gly Lys Arg Phe Phe Pro Arg Cys Ser 
500 505 510 

Ala Val Leu Asp Gin He Met Asn Cys Glu Asp Leu Thr Gin Leu Ala 
515 1 520 525 

Cys Gly Glu Asp Asp Thr Ala Glu Lys Arg Leu Gin Lys Lys Gin Arg 
530 535 540 

Tyr Met Glu He Gin Glu Thr Leu Lys Lys Ala Phe Ser Glu Asp Asn 
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545 



550 



555 



560 



Leu Glu Leu Gly Asn Ser Ser Leu Thr Asp Ser Thr Ser Ser Thr Ser 
565 570 575 

Lvs Ser Thr Gly Gly Lys Arg Ser Asn Arg Lys Leu Ser His Arg Arg 
580 585 590 



Arg 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 



jSj. (xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

M He Arg Arg Met Arg Arg Ala Leu Asp Ala Ala Asp He Glu Leu Val 

03 15 10 15 

Lys Leu Met Val Met Gly Glu Gly Leu Asp Leu Asp Asp Ala Leu Ala 
W 20 25 30 

in 

" Val His Tyr Ala Val Gin His Cys Asn 

m 35 40 

W (2) INFORMATION FOR SEQ ID NO: 5: 

fj5 (i) SEQUENCE CHARACTERISTICS: 

5? (A) LENGTH: 38 amino acids 

M (B) TYPE: amino acid 

El (C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Pro Thr Gly Lys Thr Ala Leu His Leu Ala Ala Glu Met Val Ser Pro 
15 10 15 

Asp Met Val Ser Val Leu Leu Asp His His Ala Asp Xaa Asn Phe Arg 
20 25 30 

Thr Xaa Asp Gly Val Thr 
35 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 
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(D) TOPOLOGY: not relevant 
(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

lie Arg Arg Met Arg Arg Ala Leu Asp Ala Ala Asp lie Glu Leu Val 
1 5 10 15 

Lys Leu Met Val Met Gly Glu Gly Leu Asp Leu Asp Asp Ala Leu Ala 
20 25 30 

Val His Tyr Ala Val Gin His Cys Asn 
35 40 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Arg Arg Pro Asp Ser Lys Thr Ala Leu His Leu Ala Ala Glu Met Val 
1 5 10 15 

Ser Pro Asp Met Val Ser Val Leu Leu Asp Gin 
20 25 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

He Arg Arg Met Arg Arg Ala Leu Asp Ala Ala Asp He Glu Leu Val 
1 5 *" 10 15 

Lys Leu Met Val Met Gly Glu Gly Leu Asp Leu Asp Asp Ala Leu Ala 
20 25 30 

Val His Tyr Ala Val Gin His Cys Asn 
35 40 

(2) INFORMATION FOR SEQ ID NO: 9: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Arg Arg Pro Asp Ser Lys Thr Ala Leu His Leu Ala Ala Glu Met Val 
15 10 15 

Ser Pro Asp Met Val Ser Val Leu Leu Asp Gin 
20 25 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 
0 (D) TOPOLOGY: not relevant 

,5 (ii) MOLECULE TYPE: peptide 

p 

01 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

0 He Arg Arg Met Arg Arg Ala Leu Asp Ala Ala Asp He Glu Leu Val 

m 1 ' 5 10 15 

Lys Leu Met Val Met Gly Glu Gly Leu Asp Leu Asp Asp Ala Leu Ala 

% 20 25 30 

HI Val His Tyr Ala Val Gin His Cys Asn 

35 40 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Pro Thr Gly Lys Thr Ala Leu His Leu Ala Ala Glu Met Val Ser Pro 
15 10 15 

Asp Met Val 
(2) INFORMATION FOR SEQ ID NO: 12: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
AATTCTAAAG CATGCCGATC GG 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
AATTCCGATC GGCATGCTTT A 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
AATTCTAAAC CATGGCGATC GG 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 415: 
AATTCCGATC GCCATGGTTT A 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
CCAGCTGGAA TTCCG 15 
(2) INFORMATION FOR SEQ ID NO: 17: 



H (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 
yp (B) TYPE: nucleic acid 

g (C) STRANDEDNESS: single 

H (D) TOPOLOGY: linear 

II! (ii) MOLECULE TYPE: other nucleic acid 

3 (A) DESCRIPTION: /desc = "oligonucleotide" 

Pi 

na 

\n (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

CGGAATTCCA GCTGGCATG 19 
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